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' A unified treatment for the existence of free energy in several 

, random energy models is presented. If the sequence of distributions 

D ■ associated with the particle systems obeys a large deviation principle, 

^"* >l | | then the free energy exists almost surely. This includes all the known 

cases as well as some heavy-tailed distributions. 

00 
(N 

1. Introduction. The purpose of this note is to bring out the essence 
involved in the existence theorems for the limiting free energy in several 
Qh ' random energy models. Basically, they are corollaries to the large deviation 

principle (LDP) obeyed by certain empirical measures coupled with Varad- 
han's lemma. Since this note is addressed to probabilists, the results are 
first formulated in the large deviation setting and then the consequences to 
the spin glass models are explained in each section. At the first sight it may 
appear that the large deviation principles proved here are nothing but the 
well known principles for empirical measures, but however a little reflection 
shows that it is not so. For large deviation terminology, we refer to [3,0]. 
qq . We could have given the most general result of section 5 and then deduced 

all the results from it. Instead, we decided to go from simple model to the 
more general one, so that it is easier for the reader to follow. 

o 



2. Random Energy Model (REM). Let (Ajv, N > 1) be a sequence 
of probabilities on the real line M. Assume that A at =>■ A and satisfy large 
deviation principle with a convex rate function X{x) on R. For every N, 
suppose < i < 2 N be i.i.d. random variables having distribution Ajy. 
Define for each u, un(uj) to be the empirical measure, namely ^n{<^) = 

^ £*(6M)- 

Theorem 2.1. For a.e. to the above sequence {hn(lo)} satisfies LDP 
with rate function J given by, 

J(x) = l(x) if J(x) < log 2 
= oo if 1{x) > log 2. 
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PROOF. Step 1: Let A be a subinterval ofR. If 2 Aat(A) < oo, then 
almost surely eventually ^v(A) = 0. 

Indeed, using P for the probability on the space where the random variables 
are defined, 

P(^v(A) > 0) = P(ai e A for some i) < 2 N P N (A). 

Use Borel - Cantelli. 

Step 2: Let A be a subinterval of R. If 2 A 'A^(A) < 00 > ^ ien f or an ^ 
e > almost surely eventually 

(1 - e)Ajv(A) < m (A) < (1 + e)Ajv(A). 

Indeed, 

Var m (A) = E (-L £ l^tf - \%(A) < ±\ N (A). 
Chebyshev yields 

P(|MA) - A iV (A)| > eAjv(A)) < ■ 
Use Borel-Cantelli. 



Step 3: Fix an interval A for which log Aat(A) has a limit, say, — L < 

— log 2. Fix a > such that —L<— log 2 — a. Then for sufficiently large 
N, ilogAAr(A) < - log 2 -a, that is, Aat(A) < 2- N e~ Na . In other words, 
2 N X N (A) < e~ Na for all large N. Thus by Step 1, a.s. eventually /zjv(A) = 0. 
So ^log^Af(A) = — oo a. s. eventually. 

Step 4: Fix an interval A for which log Aat(A) has a limit, say, —L > 

- log 2. Fix a > so that — L > - log 2 + a. So for large AT, log Aat(A) > 
-log 2 + a, that is, Ajv(A) > 2~ N e Na . In other words, 2 7V A JV (A) > e Na . 
Now use Step 2 with e = ^ and take logarithms to get almost surely, 

lim -j-log/ijv(A) = -L= lim -|- log Ajv(A) 

iv — >oo iv N-^oo iv 

Proof of the theorem is completed as follows. X being convex, clearly the 
set {x : T(x) = log 2} has at most two points. Let now A be an open in- 
terval bounded away from these two points. If A C {x : I(x) < log 2} then 
lmijv^oo log ^at(A) = — inf xg A^(^) = — 2T(A) by Step 4, where as, if 
A C {x : T(x) > log2} then limA^oo log^Ar(A) = — oo by Step 3. By 
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theorem 4.1.11 in J is the rate function for the sequence {^(u;)} for 
all most every uj. 

The implications for REM [5] are clear. Here for fixed N, one denotes the 
random variables £ by Hjq{a) indexed by a G {+1, — 1}^. One considers 
the partition function Z N (/3) = £ e -PH N (*) = 2 N E a e~ l3HN where E a is the 

expectation with respect to the uniform probability on the a space. The 
limit, lim i log Zjq{f3) when exists, is called the free energy of the system. 

N — >oo 

Now suppose {\n} is a sequence of probabilities as above and H^{a) = 
N^^{a) where for fixed N, £at(ct) are i.i.d. (as a varies) with distribution 
A at. Let us assume that J has compact support. This is ensured by assuming 
I to be a good rate function [4] or at least {x : I(x) < log 2} is bounded. 

The case when A at is centered Gaussian with variance i translates to Hn 
being of variance N. This is the classical case |! HQ. Here T(x) = £ . As 



a result, for almost every u, the rate function of the sequence {/ijv(w)} is 

2 2 

T(x) = %- if %- < log 2 and oo otherwise. This in turn implies, by Varadhan's 
lemma Q, H, 12], lim -k log Z^(S) = log 2 — inf {/3x + which can 

AT JV x 2 <21og2 ^ 

easily be evaluated. 

The case when Aat has density ye - ^^ for — oo < y < oo corresponds 



to .Hat being two sided exponential with parameter one considered in 11]. 
Weibull distributions can also be considered is clear that sym- 

metry of the distributions does not play any role. It is also easy to see that 
when the random variables are, moreover, non negative then there is no 
phase transition. 

3. Generalized Random Energy Model (GREM). The setup is 
the following. Let n > 1 be a fixed integer. For each j, 1 < j < n, we have 
a sequence of probabilities {X J N ,N > n} on R which weakly converges to 5o 
and obey LDP with a convex rate function Ij. Denote {+1,-1}^ by Sat. 
For each N, let k(l,N), . . . , k(n,N) be non-negative integers adding to N 
and put SjTv = {+L —l} k V' N \ Clearly, Sat = Siat x X^at x • • • x E n Af and we 
express cr G Sjv as <7i<72 with cr, G Xiiv, in an obvious way. For fixed 

TV" we have a bunch of independent random variables as follows: {£(01) : 
o\ G Siat} having distributions Ajy, {^(o"iO"2) : ^"2 G X^a^ci G Xiat} having 
distributions A^ and in general {^(o"io"2 • ■ ■ Oj-\Gj) : <Tj G SjjVj ■ • • , cri G 
Siat} having distribution X 3 N . 

Define for each to, hn{w) to be the empirical measure on M n , namely, 

WvM = ^n^S (£{<Tl,U)),Z((Ti(T2,w), - ■ ■ ,£(<7i •••cr n ,u/)) 
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where 5 (x) denotes the point mass at x G W 1 . 

Theorem 3.1. Suppose k ^"> — > Pj > /or 1 < j < n. Then for a.e. uj, 
the sequence {/Utv(w), N > n} satisfies LDP with rate function J given as 
follows: 

j j 

Supp(J) = {(xi, ■ ■ ■ , x n ) : Y, Zk{xk) < T, Pk log 2 for 1 < j < n} 

fc=l fe=l 

and 

n 

^0*0 = E ^fc(^fc) ifx£Supp(J) 
fe=l 

= oo otherwise 

Proof. The proof proceeds as in Theorem 2.1, we only explain the steps 
involved. In what follows A denotes a box in M n with sides A^; 1 < j < n 
where each Aj is an interval. 

Step 1: If for some j, 

j 



£n 2fcM ° A MA 4 )<oc 



N>ni=l 



then a.s. eventually /xjv(A) = 0. 
Step 2: If for each j, 

j 1 

II nk(i,N)\i (A.\ < °° 

then for any e > a.s. eventually 

(1 - e) f[ AV(A,) < MJV (A) < (1 + e) ft 4r(A*)- 

i=l i=l 



This step involves calculation of Var (/xjv(A)) which is carried out in a 
more general set up in section 5. The reader may also consult BSQ. The 
remaining two steps are accordingly modified. 

The implications for GREM[6] are clear. For fixed N, and a £ £tv one 
defines the Hamltonian 

n 

H N {a) = Jv£a i £(<7 1 ---ff i ). 

i=i 

Here dj, 1 < i < n are positive numbers called weights. In the Gaussian 
case, it is customary to take Y a i = 1> though it is not a mathematical 
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necessity. As earlier, Zn((3) = J2a e @ Hn ^ . Special choices of lead to 
all the known models considered. Centered Gaussians were consider in 0, 
. More general distributions as well as the cases when some pj are 
zero were considered in [121. Moreover one could take different distributions 



for different values of j, see 13j for some interesting consequences. Thus the 
main problem of GREM is reduced to a variational problem. Note that, if 
n = 1, GREM reduces to REM. 

4. Bolthausen-Kistler Model (BKM). We follow the same notation 
as in the previous section for Sjv, Sjjv, k(i, N) and a = o\Oi ■ ■ ■ a n . Let I = 
{1, 2, • • • , n} and S be the collection of non-empty subsets of /. Suppose that 
for each non-empty s C /, we have a sequence of probabilities {X S N : N > 
n} weakly converging to <5o and obeying LDP with a convex rate function 
T s (x). For s = ,ik} £ S where i\ < %i < ■ ■ ■ < i^, we denote 

o~(s) = Oi^Oi^ - ■ ■ Oi k . Now for fixed N, we have a bunch of independent 
random variables £(s,er(s)) as s and a(s) vary. For s G S, all the £(s,cr(s)) 
have distribution A^. We define the empirical measure on R s , by 

WvM = 2iV Yj 6 (^( s > °"( s )> w ) : s G S ) • 

Theorem 4.1. Suppose — » > /or 1 < j < n. T/ien /or a.e. 
i/ie sequence {^in(uj), N > n} satisfies LDP with rate function J given as 
follows: 

Supp(J) = {{x s , seS): Vs e S, £ 2*(x*) < £ Pfc log 2} 

tCs k£s 

and 

J( x ) = E 1s(x s ) if x = (x s ) G Supp(J) 
= oo otherwise 



The proof proceeds along the same lines as in Theorem 3.1. The only 
complications, again, are in calculating Var {in (A). See next section. 

In BKmQ], we have Hamiltonian H^{a) = N a s£,( s , °~{ s )) where a s , s G 

seS 

S are non-negative weights. Again if J has compact support, Varadhan's 
lemma reduces calculation of free energy to that of a variational problem. In 
BKM, they do not consider all non-empty subsets of I, rather, some sub col- 
lection. But that can be achieved by taking some weights a s to be zero appro- 
priately. Further, if one considers a chain like {1}, {1, 2}, • • • , {1, 2, • • • , n} = 
/ and takes non zero weights a s for s only in the chain, one gets GREM. 
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5. Models with External Field. The model described in this section 
includes all the above and also incorporates external field. To describe the set 
up, we will use the same notation as above except for S and s. Here the set of 
all non-empty ordered sequences s = (ii, 12, ■ ■ ■ , ik) of distinct elements from 
7, having length < n will be denoted as S. We use the same notation of o~(s) 
as in the previous section. For s £ S, we have a sequence of probabilities 
{A^, N > n} weakly converging to do obeying LDP with a convex rate 
function l s . As earlier, for fixed N, we have a bunch of independent random 
variables £(s,<r(s)) as s and a(s) vary. For s € S, all the £(s,<r(s)) have 
distribution X S N . Let cfj denote the sum of the k(i,N) many +1 and — 1 
appearing in o"j. We define the empirical measure on x M. n by 

Vn(w) = ^n^2 S cr ( s )> w ) : s G S) , : l<i<njY 

Points in M 5 x R n will be denoted by ((x s ,s 6 S),(yi,i < n)) or simply as 
(x s ,y z ). For i C I, we will denote Sa to be the set of non-empty ordered 
sequences of distinct elements from A. 

First note that, by Cramer's theorem 0], the arithmetic averages of i.i.d. 
mean zero, ±1 valued random variables satisfy LDP with rate function Xq 
where Io{y) = 00 for \y\ > 1 and for —1 < y < 1, 

^o(y) = y tanh^ 1 y — log cosh (tanh -1 y) 
= i+K]og(l + y) + ^log(l-y). 

Theorem 5.1. Suppose — > > /or 1 < j < n. T/ien /or a.e. 
i/ie sequence {^n(uj), N > n} satisfies LDP with rate function J given as 
follows: 

Supp(J) = Ux s , yi ) : VA C 7, E Zi(s t ) + E Pi^o ft) < E Pfclog2l 
{ teS A i&A fceA J 

and 

J(x s ,yj) = E 2;0s) + EPi^o ft) if (x s , yi ) € Supp(J) 
seS iel K ' 

= 00 otherwise 

Proof. In what follows, □ = ILes A s x Vu=\ v i is a box in R s x M n 
where A s for each s £ <5 and Vj for i < n are subintervals of R. Also ^4 
denotes non empty subset of 7. For A C 7, we denote Qa/v = II ^at(^s); 

fc(A,JV)= £fc(t,JV) anda AJV = ^y E II Mft). 

igA (cr;: ieA) «SA 
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Step 1: If for some A C I, J2N>n2 k< " A ' N ^QANC(AN < oo then almost 
surely eventually /Ltjv(n) = 0. 

Let A be such that J2n>u ^ A ' N) Qanolan < oo. Then 

md) =F^n ia. *(*))) n iv 4 (f 

<r seS i<n v 

n ia.(£0>, *(*))) n iv.f^) 
= 2™ e n iA s (e( s ,d( s ))) n ivJl) 

As a consequence, 

p( / x A t(d)>o) =p( e n iaAsm*))) n lv 4 (^) >lj 

<Qm e n lv 4 (f ) 

(o-jrisA) i&A x ' 

The hypothesis and Borel-Cantelli lemma complete the proof of the claim. 

Step 2: If for all A C I, T,N>n 2 ~ k{A ' N) QAN a AN < 00 then f or an y 
e > o, almost surely eventually, 

(1 - e)Q IN a IN < /Xjv(D) < (1 + e)Q IN a IN . 
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Note that 
Var(/xjv(n)) 

= ^EEe(j[i As 1a. (*(a,r(*)))) n iv, lv, 

—Q 2 IN a2 IN 

< 2^r E E E £;(niA s (^,cr( S )))l As (^,r( S )))] X 

n iv. (t) iv, 



7^ cr^ , Vi £ A 



Here is how we get the inequality above. The terms appearing in the first 
expression corresponding to cij ^ Tj for alH € I are canceled with Q 2 IN a 2 IN . 
Now by Chebyshev's inequality 

P(|MD) " ^0=01 > eE m (D)) < 1 ]T nk(AN)n ■ 

e X^j 2 k{ ~ A ^)Q AN a A N 

Once again Borel-Cantelli lemma and the hypothesis yield that a.s. eventu- 
ally 

(1 - e)Efi N (D) < n N (D) < (1 + e)E m (D). 
Since E/j,n(0) = Qinolin, proof of the claim is complete. 

Step 3: If a box □ C R x lR n has empty intersection with Supp(J'), then 
by Step 1, almost surely eventually /xjv(D) = and hence lim jt log///v(D) = 

— CO. 

Step 4: Suppose that D C I s x 1™ has non-empty intersection with the 
interior of Supp(J). Then by Step 2, almost surely, lim 4Tog/Xjv(D) = 

JV— »oo 



lim 4r log QinQ^in = 

N — >OC 

has only to decipher Qin^in- 



s€S l<i<n 



. To see this, one 
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Since there are enough boxes □ - by convexity of I s - satisfying conditions 
of either Step 3 or Step 4 to form a basis, the proof is completed as in 
Theorem 2.1 using Theorem 4.1.11 in [4|. 

Implication of the above result will be clear if one defines the Hamiltonian 
Hn{c) = N J2 a sli{s,cr(s)) + ha where a s , s € S are non-negative weights 

and h > is the strength of the external field. Here also if J has compact 
support (which is true when the rate functions X s are convex), Varadhan's 
lemma reduces calculation of free energy to that of a variational problem. Of 
course, it is not always possible to solve this variational problem to arrive 
at a closed form expression. When the external field is and when only 
increasing sequences have positive weight, then this reduces to BKM. 

6. Remarks. 1. As the reader would have noticed, convexity of the 
rate function I is not essential. For example, the results are valid if the rate 
function strictly decreases on (— oo,0) and strictly increasing on (0,oo). In 
fact, this will then include the Weibull distribution with shape parameter 
smaller than one. 

2. The factor 2^ could easily be replaced by k (k integer > 1) with 
appropriate changes in the theorems. 

3. We do not have any precise formulation of the results for GREM with 
n = oo [3 1 ]. Perhaps these arguments do not work for SK-model. 
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